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This invention relates to the development of recombinant eukaiyotic don- 
ing and expre sion vectors based on unique regulatory elements isolated from 
autonomously replicating, stable episomal units from human tumor cell lines. 
More specifically, the unique regulatoiy elements relate to origins of replication, 
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and expression vector will accommodate genes that exceed the cosmid limit 
(greater than 50 kb) and permit their maintenance as autonomously replicating 
extrachromosomal elements in mammalian cells. Inclusion of telomeres and cen- 
tromeres would control the replication and segregarion and therefore serve as an 
eventual vehicle for gene replacement therapy. This invention is therefore unique 
m providing for the expression and autonomous replication of large genes, 
maintained extrachromosomally, in a vector containing episomal regulatory ele* 
ments. 
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A EUKARYOTIC EPZSOHAL DNA CLONING AND EXPRESSION VECTOR 

10 

This Inven'tion relates "to development of 
recombinant eukaryotic cloning and expression vectors 

15 based on uniqpie regulatory elements isolated from 

autonomously replicating, stable episomal units isolated 
from human tumor cell lines. More specifically, the 
unic[ue regulatory elements include origins of DNA 
replication, and DNA sequences that confer 

20 extrachromosoxoal stability and maintenaince. These unique 

episomal regulatory elements permit large pieces of DNA to 
be expressed or cloned (greater than 50 kilobase pairs 
[kb] in size). 

25 During the past decade, the underlying significance 

of recent advances in molecular biology has been the 
ability to clone and manipulate DNA from virtually any 
source by ligating restriction fragments into phage or 
plasmid vectors which are then replicated in E. coll. 

30 

Since then, a crucial technological gap has developed 
in what is commonly called "conventional recombinant DNA 
technology." This technological gap stems from two 
developments. The first is the discovery that many 
35 eukaryotic genes are encoded by enormous lengths of DNA. 

The second is an optimistic and enthusiastic goal of 
mapping and sequencing entire genomes, including the hxman 
genome. 
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Because of tlie lar? size f DNA in many genes from 
higher organisms, Uils size liml1:atlon and res'trlctlon can 
be stifling. For example, blthorax locus in Dropsophil^, 
which plays an active role in the fly's segmentation 
5 pattern, encompasses approximately 320 kb (Kerch, et al.. 
Cell 43:81, 1985). Factor VIII gene in the human which 
encodes the blood-clotting factor deficient in 
hemophiliacs, spans at least 190 )cb (Gitschler, et al.. 
Nature (Iiondon^ . 312:326, 1984). The gene that is 

10 defective in Duchenne's muscular dystrophy is estimated to 
include more than a million base pairs (1000 kb) • A 
striking feature of this gene is the protein-coding 
portion may be encoded by as little as 15 kb of DNA 
(Monaco, et al. Nature (liOndon^ - 302:575, 1983). Thus, 

15 there is a strong need for technological advances which 
permit the cloning and expression of very large genes. 

Also widening this technological gap is the increased 
interest in and enthusiasm for gene replacement therapy. 

20 Proposals to use genes to treat cancer and immune 

deficiencies have only recently been approved by the 
National Institutes of Health human gene therapy 
subcommittee and the Recombinant DNA Advisory Committee 
( Science . 249:974, August, 1990). These first studies 

25 focus on: 

(1) delivering tiimor necrosis factor (TNF) directly 
to a tiimor site in much larger doses by 
packaging the gene for TNF inside special 

30 lymphocytes that have a natural 

affinity for tumors; and 

(2) attempting actual gene replacement therapy in 
children with a rare, inherited and often lethal 

35 immune system disorder caused by adenosine 

deaminase deficiency. A normal healthy 
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recoxnbinantly pr duced ADA g ne will be 
inteoduced Into the white blood cell of an ADA 
deficient child and the cells are then returned 
to the patient {16^ 975) . 

5 

To narrow this gap, molecular biologists are 
attempting to clone large pieces of exogenous DNA into 
co2Dpatible hosts by means of artificial vectors. However, 
standard recombinemt DNA techniques, that involve the 

10 construction of small plasmid vectors that can be 

transfected into host cells and clonally propagated, are 
limited in the amount of exogenous DNA that can be 
"squeezed** or inserted into these vectors. These size 
restrictions only permit about 50 kilobase pairs (kb) to 

15 be cloned into the vectors usually employed in cloning. 

More limitations exist when the discussion turns to 
the bacterial expression of mammalian proteins. The 
current technology for expressing mammalian proteins in 
20 bacteria is hampered with problems relating to post 

translational modifications and functional bioactivity. 

To date, cloning of large segments of exogenous DNA 
in the range of several hundred kilobase pairs has only 

25 been achieved by employing yeast. This was done by 

ligaring exogenous DNA to vector sequences that allow 
their propagation as linear artificial chromosomes (Burke, 
et al. Science . 236:806, 1987). Although this technique 
is a significant step towards resolving this size 

30 restriction, cloning large segments of exogenous DNA into 
yeast is not without limitations. Questions and concerns 
about this technology pertain to (1) the stability of the 
recombinants, (2) whether clone banks are representative 
of the starting material, (3) whether the desired protein 

35 is consistently expressed in extrachromosomal vectors, and 
(4) whether normal human transcripts are properly 



wo 92/07080 



PCr/US91/07690 



-4- 

processed in yeast, as well as, whether proper express! n 
and post trans lational modification of the recombinant 
protein occurs in yeast. 

5 Therefore, with the yeast expression system and its 

limitations, there is still a very strong need to design 
and construct eukaryotic expression and cloning vectors 
possessing the capabilities of housing very large regions 
of DNA (greater than 50 kb) and of accurately processing 
10 and expressing of these large genes. With such a novel 
vector, large regions of DNA that span genes can then be 
cloned and whole proteins encoded by the genes can then be 
expressed. 

i5 One mechanism by which a cell can accumulate large 

amotmts of specific protein or RNA is by amplification of 
the respective gene. This amplification may be located on 
either expanded chromosomal regions (homogenous staining 
regions) or on extrachromosomal autonomously replicating 

20 elements (called double minute, double minute chromosomes 
or episomes) « 

Episomes have unique featvures; the most notable are 
that episomes autonomously replicate and are stably 

25 maintained extrachromosomally. The characteristics of 
episomes permits the continuous production of the 
respective asqdlified gene and the gene products it 
encodes. For example, an episome produced in hamster 
cells has been characterized to contain amplified amounts 

to of a transfected CAD (CAD is an acronym for the 

multifunctional protein containing carbamylphosphate 
synthetase, aspartate transcarbamylase, and 
dihydroorotase) gene at high frequency (Carrol, et al., 
MolecuXar and cellular Biology, 7(5):1740, 1987). The 

5 anqplif ied CAD gene is produced with each division of each 

cell. 
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Viral episomes have also been identified* It has 
been dem nstrated that papilloma viral OKA r plioates lik 
a plasmid in mouse cells, circular bovine papilloma virus 
(BPV) DNA can transform certain mouse cell lines to a 
5 malignant phenotype. In these transformed cell lines, th 
BPV DNA remains circular and extrachromosomal at about 30 
- 100 copies per cell. This "plasmid" is being stably 
maintained in higher eukaryotes. Desired genes may be 
inserted into the BPV DKA and be maintained in the 
10 plasmid-^like state and high levels of mRNA and protein 

corresponding to the desired gene can be produced. It has 
also been shown that Epstein-Barr virus vectors contain 
sequences that provide extrachromosomal stability of 
episomal DNA as well as origins of replication. This 
15 viral vector has been used to identify human DNA sequences 
that permit autonomous replication in human cells (Krysan, 
et al.. Molecular and Cellular Biology, 9(3}:1026, 1989). 
But, it can be appreciated that there ve many limitations 
when working with a virally produced protein. For 
20 example, in terms of producing proteins that may 

ultimately be used to replace defective human genes, viral 
episomes probably are not feasible because of potential 
Food and Drug Administration regulations, etc. Also the 
viral episome eventually integrates into chromosomal sites 
25 which then interferes with continued amplification and 
causes the expression of its resident genes to be 
extinguished • 



Thus, the limitations in terms of integration into 
30 chromos mal sites and of potential hazards pertaining to 
the use of viral based vectors for amplification and 
expression apply to all eukaryotic viral episomes. 

It is the intent of this invention to describe a 
35 eukaryotic cloning and an expression vector which will 
accommodate genes that exceed the cosmid limit (greater 
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than 50 kb) and permit th ir accumulation and maintenance 
as autonomously replicating extrachr mosomal elements in 
mammalian cells. This invention is therefore unique by 
providing autonomous replication and expression of large 
5 genes in a vector containing episomal regulatory elements. 

This minimal cloning or expression vector will be 
further modified by the inclusion of regions of human 
chromosomes containing telomeres and centromeres. This 
would thus create a human artificial chromosome that would 
be subjected to the same control mechanisms (regarding 
regulation and chromosomal segregation) as normal 
chromosomes and therefore serve as a vehicle for gene 
replacement therapy. This modification of the 
extrachromosomal vector is therefore unique in that it 
will be a synthetic chromosome containing genes of choice, 
that will be expressed, and that will be maintained and 
regulated as if it were a normal chromosome. 

This cloning or expression vector may take on several 
forms. For example, two principal forms for employment 
are: (1) employed via extrachromosomal /episomal, 
autonomous replication and segregation which could even be 
amplified, and (2) employed via a human artificial 
chromosome under normal chromosomal control mechanisms. 

In general and overall scope, the present invention 
relates to the development of recombinant eukaryotic 
cloning and expression vectors based on unique regulatory 
30 elements isolated from autonomously replicating, stable 

episomal units isolated from human tumor cell lines. More 
particularly, these unique regulatory elements include 
origins of DNA replication, and DNA sequences that confer 
extrachromosomal stability and maintenance. These unique 
35 episomal regulatory elements will permit large pieces of 
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DNA t be expressed or cloned (greater than 50 kil bases 
pairs in size) • 

This invention discloses procedures for producing two 
5 different types of vectors. One is a cloning vector and 

the other one is an expression vector. For the purpose of 
this invention, the phrase "cloning vector** refers to a 
DKA vector designed to be used to clone a desired gene* 
The techniques that are involved in cloning vary from 
10 vector to vector and from system to system, however, these 

techniques in general are standard and known to those 
skilled in the art of recombinant DNA technology. 

Also, for the purpose of this invention, the phrase 
15 ''expression vector** refers to a DNA vector capable of 

replication in selected mammalian host cells and 
expressing a desired protein. This protein may then be 
recovered from the cells by employing techniques known to 
those skilled in the art. 

20 

This cloning vector should include one or more 
functional origins of DNA replication to permit stable, 
autonomous replication. The phrase "origin of 
replication*' is defined as a region that indicates the 
25 origin of replication. 



This cloning vector should include appropriate DNA 
sequences that confer extrachromosomal stability and 
maintenance. The sequences responsible for conferring 

30 extrachrom somal stability and persistence may be related 
to sequences responsible for nuclear matrix attachment 
sites, topolsomerase II reaction sites, and/or other 
regions required for appropriate Interactions with the 
nuclear architecture. This extrachromosomal stability and 

35 maintenance permits the introduction of large exogenous 
genes. 
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This cloning vector sh uld also include DNA 
selectable marker sequences tihat can be used to confer 
drug resistance to a transf ected cell or DNA sequences 
that can correct a genetic mutation. This allows the 
5 cells that were transfected with the vector to be selected 
for. The DNA selectable marker segment confers upon a 
cell transfected with said vector, the ability to survive 
in the presence of a selected cos^oxind or selected group 
of compounds. The compound may be either G418 or 

10 hygromycin B. Also, other selectable marker segments will 

contain DNA encoding an enzyme capable of functionally 
replacing a mutated enzyme so as to render the transfected 
cell resistant to said selected compound or selected group 
of compounds. The enzyme may be selected from a group 

15 consisting of: thymidine kinase, xanthineguanine 
phosphoribosyl transferase, adenine 
phosphor ibosyltransf erase, adenosine deaminase and 
dihydrofolate reductase. 

20 This cloning vector should also include a multi-use 

multiple cloning site to facilitate recovery for genetic 
modification and analysis and insertion for reintroduction 
into cells for replication and expression. Multiple 
cloning cassette seqnience cartridges are commercially 

25 available from several different companies (Promega, New 

England Biolabs, etc) . A typical cassette sequence would 
include restriction sites for 8-11 different enzymes 
(i.e. £co RI, Sac 1, Sma 1, Ava I, Bam HI, Xba 1, Hinc II, 
Acc 1, Sal 1, Pst 1, Hind III, etc.) The availability of 

30 these cassette sequences are known to those skilled in the 
art. 

This cloning vector should also include a DNA segment 
encoding bacterial components necess£ury for propagation of 
35 said vector in bacteria. Bacterial components that are 
essential for propagation of the cloning vector in 
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bacteria are known t:o those skilled In this art. For 
exaaple, two bacterial components essential for bacterial 
propagation are a repllcon that is responsible for 
initiation of replication and antibiotic resistant markers 
5 (i.e. ampioillin, tetracycline, etc.) that permits growth 

in specific antibiotics. 

In addition to the above described five different 
components included in the unique cloning vector, a unique 
10 expression vector capable of expressing large pieces of 

DMA (40 - 400 kb) should also include, a promoter, a 
polyadenylation site and a splice site in spacial relation 
to allow efficient expression of a structural gene. 

i5 The choice of promoters to be included in this vector 

will depend on the mammalian host cell employed. It is 
advantageous to employ a compatible promoter with regard 
to the cells that the desired protein will be expressed 
in. The inventors prefer to employ promoters derived from 

20 the following genes (although other promoters would be 
satisfactory): cytomegalovirus, SV-40, Rous sarcoma 
virus, thymidine kinase, beta-actin, metallothionein, and 
the epidermal growth factor receptor gene isolated from a 
DiFi episome. 

25 

For the purpose of this invention, a polyadenylation 
site refers to the site at which a poly A tail (a stretch 
of 50 to 300 adenines) is added to the vector for 
efficient expression of a desired protein in a mammalian 
30 cell. Also, the phrase "splice site" refers to a 

bacterial processing site essential to remove introns 
incorporated into the bacterial plasmid. These components 
are essential for optimal expression of a desired protein. 

35 A further embodiment of this invention is an 

artificial chromosome consisting of a DNA segment derived 
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froiD a non-viral epls me, said segmen't c n'talning an 
origin for DNA replication, a DNA segment derived from a 
non-*viral episome, said segment containing a ONA sequence 
which confers upon said vector the ability to be stably 
5 maintained extrachromosomally in a cell transfected with 
said vector, a DNA segment containing a multiple cloning 
site, a DNA selectable marker segment conferring upon a 
cell transfected with said vector, the ability to survive 
in the presence of a selected compound or selected group 
10 of compounds, a DNA segment encoding bacterial components 
necessary for propagation of said vector in bacteria, a 
promoter, a polyadenylation site, a splice site, a DNA 
segment encoding a centromere and a DNA segment encoding a 
telomere. 

15 

Further in accordance for this invention is a 
substantially purified non-viral episome of human origin 
capable of stable extrachromosomal maintenance and of 
autonomous replication in a compatible maunmalian cell 
20 line. 



Further in accordance for this invention is a 
substantially purified episomal DNA segment containing an 
origin of replication* This invention further includes a 

25 substantially purified* episomal DNA segment containing a 

DNA sequence, which confers upon a vector including said 
segment, the ability to be stably maintained 
extrachromosomally in a cell transfected with said vector. 
Another embodiment of this invention is a substantially 

30 purified, episomal DNA segment containing both an origin of 

replication and a DNA sequence, which confers upon a 
vector including said segment, the ability to be stably 
maintained extrachromosomally in a cell transfected with 
said vector. 



35 
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Anotiier embodiment of this invention is a DMA s gment 
c ntaining an origin for ONA replication Is from an 
episome isolated from DlFl colorectal cell line. 



5 Another embodiment of this invention is a DNA 

sequence which confers upon said vector the ability to be 
stably maintained extrachromosomally in a cell transfected 
with said vector is from an episome isolated from DlFl 
colorectal cell line. 

10 

Another embodiment of this invention is a DKA segment 
containing an origin for DNA replication and a DNA 
sequence which confers upon said vector the ability to be 
stably maintained extrachromosomally in a cell transfected 
15 with said vector is from an episome isolated from DlFl 
colorectal cell line. 



The various techniques which have been successfully 
applied to the cloning and expression of many genes in a 
20 variety of host systems, employing many different 

promoters and vectors, are known to those skilled in the 
art of recombinant DNA technology and could be applied to 
the embodiments described herein. 

25 For the purpose of this invention, the phrase 

**operatively spaced with respect to a desired gene" is 
defined as the appropriate positional spacing required 
between the numerous cloning and expression vectors 
components described in this invention so as to allow each 

30 of the of components to achieve its desired function. 

These components are also dlrectlonally positioned 5' to 
3'. The appropriate spacing needed for efficient cloning 
or expression of a desired gene is determined for each 
individual vector. 



35 
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In ^erns of transfec^lng euJcaxyotlc cells with t:he8e 
iinlque cloning or expression vectors, the -transfecti n 
tochnigues are standard and known to those skilled in the 
art of recombinant DNA technology. In terms of 
5 transfecting cells with the iinique expression vector, this 
invention could also be applied for the production of 
stable cell lines which are, by definition, continuously 
producing the desired protein. The production of cell 
lines designed to continuously produce the desired protein 
10 has been described extensively in the literature, and is 
therefore known to those skilled in the art. 

CHARACTERI STICS OF THE DEPOSITED CEIX LTWE 
Cell line **DiFi" comprising cells obtained from the 
15 ascitic fluid of a colorectal tumor in a patient with 

Gwdner's syndrome, is available from the ATCC, accession 
# CRL 10576. This cell line retains 50 copies or more of 
extrachromosomal episomes, each of which contains at least 
one complete copy of the epidermal growth factor receptor 
20 gene. 

Fig. 1. In situ hybridization of DiFi cells with Ecra. 

A portion of a metaphase from DiFi cells stained with 
Giemsa (A) , fluorescence visualization of in situ 
25 hybridization using biotinylated EGFR as probe and 

counterstained with propidium iodide (B) , and a black and 
white print of the fluorescence pattern of in situ 
hybridization (C) . 

30 Fig. 2. Electrophoretic mobilization of E GFR genes by 

gamma irradiation. 
Autoradiogram of a Southern blot of a TAFE gel 
hybridized with 32P-labeled £SEB« Origin (o) is indicated 
at the top as is the direction of micpration. Plug samples 
35 1-8 were exposed to 0, 5, 10, 20, 40 80, 160, 320 Gray, 
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resp ctlv ly. Hybridization membranes ver exposed to 
film for 24 hrs* 

Fig. 3. Effect of gamma irradiation on the 
5 electrophoretic mobilization EGFR in A431, 

DiPi, and HeLa cells, 
Autoradiogram of a Southern blot of a TAF£ gel 
hybridized with 32P-labeled £SEB* Origin and direction of 
migration is as in Fig. 2. A431, DiFi and HeLa cell DNA 
10 plugs were irradiated with A. OGy, B. 10 6y, C. 40 Gy, D. 
160 6y. Autoradiographic exposure was extended to 72 hr 
in order to enhance sensitivity for detecting any 
fragments that might have migrated from the A431 plugs. 

15 Fig. 4. CHEF analysis of EGFR in gamma irradiated DiFi. 

Plugs containing DiFi DKA were exposed to 31.4 Gy 
prior to electrophoresis. The analysis of control (c) and 
irradiated (R) samples %m6 performed in duplicate. 
Approximate sizes of the observed fragments, in kbs, are 

20 indicated to the right. 

INTRODUCTION TO THE DISCLOSED INVENTIONS 
AUTONOMOUSLY REPLICATING, STABLY MAINTAINED 
MICROCHROMOSOMAL UNITS FROM HUMAN TUMOR CELL LINES 
25 In developing the invention, we elected to use stably 

maintained extrachronosomal units arising in some 
eukaryotic cell lines as starting material, because these 
units contain all the genetic regions required for 
autonomous replication and extrachromosomal expression. 
30 Those steps are described below. 

In initial studies, the episomes are isolated from 
the origin in a substantially purified form and the 
minimal essential elements for episomal replication and 
35 transcription are localized and isolated. Those elements 

are then ligated into a selected DNA molecule, together 
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with addi-tional DNA segments. Including, for example, 
selectable markers, multiple cloning site r sites, 
segments necessary for propagation in bacteria and/or a 
promoter enhancer, splice site and polyadenylatlon site. 

5 

Replication of nuclear DNA in eukaryotes appears to 
be under precise and reproducible control, such that it Is 
replicated only once in each S-phase, the DNA synthetic 
portion of each cell division cycle. In addition, each 
10 portion of the genome replicates at the same time in each 
S-phase, with expressing (transcribed) genes replicating 
eeurly and non-expressing and/or structural DNA replicating 
late« 

15 The genomes of proJcaryotes, viruses, and yeast 

contain DNA sequences called origins, that serve as sites 
for initiating cycles of DNA replication. By analogy, 
such sites define replicating units, or replicons, in 
eukaryotic cells such as human cells. 

20 

An accepted working hypothesis is that the eukaryotic 
nucleus is organized into structural domains in which the 
nuclear matrix plays an essential role in organizing 
chromatin structure and in regulating function. Support 

25 for this hypothesis comes from studies demonstrating that 

DNA replication, DNA repair, transcription and 
post-transcriptional processing are associated with the 
nuclear matrix* Other studies have shown that DNA 
polymerase, RNA polymerase XI, expressing and expressible 

30 genes, transcriptional enhancer sequences, topoisomerase 
II cleavage sites, topoisomerase II, and heterogeneous 
nuclear RNA (hnRNA) splicing complexes are highly enriched 
or specifically localized in the nuclear matrix. 

35 The fact that regulatory DNA sequences and the 

nuclear proteins with which they interact have not been 
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ideii'tif led is in part atteibu^able to the unmanageable 
size of chromosomes and the complexity of the genetic 
elements they contain. However, stable cell lines are 
occasionally established in which regions of specific 
5 genes have been amplified (Stark, Cancer Surveys, 5:1-23, 
1986) and occasionally are segregated into autonomously 
replicating components. These exist in the nucleus as 
episomes (200 kb - 800 kb molecules) and/or light 
microscope-visible double minute chromosomes (dmins, >1000 
10 kb) . 

This invention exploits these cell lines by isolating 
and investigating the structure and replication control of 
their extrachromosomal elements in order to identify DNA 

15 sequences required to ensure their autonomy for stable 

maintenance, replication and gene expression. This 
minimal essential structure should then provide the core 
structure with which to assemble a cloning and expression 
vector for genes exceeding sizes accommodated by cosmid 

20 vectors. 

Although the methodology described herein contains 
sufficient detail to enstble one skilled in the art to 
practice the present invention, a commercially availbale 

25 technical manual entitled MQiiECUlAR cliONlNG (Haniatis, et. 
al. , Cold Spring Harbor Laboratory, Cold spring Harbor, 
New York) may provide some additional details useful to 
assist practice of some aspects of this invention. 
Accordingly, this manual is incorporated herein by 

30 reference. 

The following examples are designed to illustrate 
certain aspects of the present invention. However, 
they should not be constmied as limiting the claims 
35 thereof. 
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EZMtPM 1 
SZroikGHROKOSOK&L AKFLZFICATIOH 
OF THB mXDEBMKL GROWTH FACTOR RECEPTOR GENE 
ZH A HUMAN COMN CARCZHOHA CELL LZNE 

5 This exaoiple describes the isolation and 

characterization of an autonomously replicating episomal 
iinit derived from a human colorectal carcinoma cell, 
established from ascites from a patient with Gardner's 
syndrome, designated "Dif i" (Bowman, et al, , in: 

10 Hereditary Colorectal Cancer. J. Utsunomiya and H. Lynch 

(£ds.)r Springer-Verlag, In Press, 1990). The invention 
is not limited to the **Difi** episome, however, for the 
basic procediures provided by the present disclosure should 
enable those of skill in the art to develop vectors from 

15 the episomes of other cells. 

DiFi cells were (l) successfully established in 
tissue culture, (2) shown to contain amplified EGFR genes 
and mRNA, and (3) cheuracterized cytologically to be near 
20 tetraploid with the presence of double minutes (dmin; 

Bowman et al. In Hereditary Colorec tal Cancer , J. 
UtstinoBiiya and H. Lynch (eds) , Springverlag, In Press, 
1990) • 

25 CELL LINES EMPLOYED AND CELL CULTURE CQNDTTTQNS 

A431 (obtained from Gary Gallick, H. D. Anderson 
Cancer Center) , HeLa and DiFi cells were maintained in 
Dulbecco^s medium supplemented with 5% fetal and 5% 
newborn calf serum. SW480 cells, a colon tumor cell line 

30 (established by Leibovitz, 1976 and obtained from Hark 

Blick, D. Anderson Cancer Center) were grown and 
maintained in L-15 medium containing L-glutamine and 
supplemented with 10% fetal calf serum, insulin (5ug/ml) 
and glutathione (I6ug/ml) . 

35 
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A. Characteristic of a human eoloreet:al cancey cell 
line fDiPi\ 

''DlFl'* colorectal carcinoma cell line represents one 
of the first cell lines to be established and 
5 characterized from a patient with Gardner syndrome. 

Malignant ascitic fluid cells were isolated from a 46 
year old female rectal cancer patient with Gardner 
syndrome and initiated to grow in cultiire. The cells have 

10 been maintained in culture for over three years. Hoechst 

stain analysis for mycoplasma was negative. Subcutaneous 
injection of DlFi cells into athymic mice demonstrated 
tumor production in 50% of the mice. The cells have a 
tetraploid karyotype, and possess an isozyme pattern 

15 characteristic of colorectal cancer cell lines. 

IfPCMflZATION OF EGFH PNA IN DiFi bv tm ^ztth 

HYBRIDIZATTQN 

The following studies demonstrated the episomal 
20 location of the amplified EGFR gene. 

Slides containing metaphase cells from either DiFi or 
SW480 cells were prepared and stored at room temperatiire. 
Prior to sjtu hybridization with a biotinylated EGFR 

25 probe, the slides were stained (six minutes in 5% Giemsa 

prepared in phosphate buffer pH 6.8) and photographed. In 
Sifeu hybridization involved treating the photographed 
slides with RNAse, DKA denaturation and dehydration 
solutions, overnight incubation in a hybridization mix 

JO containing a biotinylated EGFR probe, and tagging the 

regions of SSEB hybridization with f luorescein-avidin and 
biotinylated goat anti-avidin. This procedure resulted in 
a three layers of f luoresceln-avidin, and visualization by 
fluorescence microscopy (Pinkel et al., Proc. Na-bl. Acad, 

15 Sci. USA. 83:2934, 1986). 
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The EGFR cDNA probe, HER-A64-3 (Ullrich et al., 
Natiure 309:418-425, 1984), was labeled by nick tran8la<bion 
witii bio^in-7-dATP according -to tbe instructions provided 
by BRIj. Hybridization mix (25 ul) containing 10% PEG 6000 
5 and 5 ng of probe was used on each slide. Following in 
situ hybridization and fluorescence labeling procedures, 
slides were rinsed and counterstained in propidiiun iodide 
(2 ug/ml in H2O) for two minutes, rinsed with H^O, and 
carefully blotted dry. Two drops of antifade solution 
10 (Johnson and Aroujo, J. Immunol. Methods 43:349-350, 1981) 

were added to each slide before covering with a coverslip, 
Metaphase chromosomes were photographed under epi-UV* 
illtmination on Kodak Ektachrome 160 film using the Zeiss 
filter 25 combination 48 77 09. 

15 

Giemsa-stained metaphase chromosomes from DiFi cells 
revealed a background of extrachromosomal particles at the 
limit of optical resolution (Fig. lA) . Occasionally, they 
were paired in the form of small dmins. To determine 

20 whether these structures contained copies of the EGFR 
gene, the biotinylated A64-3 cDNA EGFR probe was 
hybridized to these metaphase cells. SW480 cells served 
as a negative control because their dmins are amplified 
for HYC rather than EGFR (Untawale, Masters Thesis on File 

25 at the Graduate School* of Biomedical Sciences, University 
of Texas Health Science Center, Houston, Texas, 1987; 
Untawale and Blick, Anticancer Res. 8:1-8, 1988). 
Thirty-five SW480 metaphase cells were examined for 
hybridization with biotinylated A64-3 cDNA EGFR probe. No 

30 hybridization was observed to any metaphase chromosome or 

extrachromosomal entity (data not shown) . The same 
analysis was performed with DiFi metaphase spreads and 
thirty-three out of sixty-six demonstrated strong 
hybridization to extrachromosomal regions. No conclusions 

35 could be drawn from the remaining thirty-three metaphase 
cells due to weak hybridization or high background. 
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Flgur 1 presents in situ hybridization of DlFl 
metaphase cells with EGFR probe. A portion of a metaphase 
spread from DlFi cells was stained with Giemsa (lA) * 
Fluorescence visualization of in situ hybridization using 
5 biotinylated egfr as a probe and counterstained with 

propidium iodide is shown in IB, and a black and white 
print of the fluorescence pattern of in situ hybridization 
is shown in IC. 



10 In the GeinLsa stained metaphase (lA) the chromosomes 

are intensely stained in contrast to the diffuse staining 
of extrachromosomal material in the background. The 
extrachromosomal background appears to be dmin, which vary 
in their size and visibility. Hybridization of the 

15 biotinylated EGFR probe (yellow fluorescence) was limited 
to extrachromosomal regions containing dmln, rather than 
chromosomal DNA (IB) . In ordwr to emphasize the 
extrachromosomal hybridization the photograph was printed 
in black and white (IC) . In Figure IC, the 

20 extrachromosomal labeling was visualized more clearly 

since the fluorescein fluorescence is more intense in dmin 
than isothe propidium fluorescence from the chromosomes. 



Therefore, in situ hybridization of the biotinylated 
25 EGFR probe in the DlFi cell line demonstrated localized 

hybridization predominantly In extrachromosomal regions 
rather than to chromosomal DNA. 



The in situ hybridization analysis presented In 
30 Figure IB and ic consistently demonstrated specific 

biotinylated EGFR localized in the extrachromosomal 
background. This specific localization is most likely 
associated with episomes many of which are too small in 
size and disorganized in structure to be visualized as 
35 dmlns in stamdard cytogenetic spreads. 
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TTT. PREPARA TTQK AND IRRADIATION OF PNA 

Af'ter confirming that: "the EGPR ainplif ication observed 
In the DiFi cells was mediated by a stable episomal 
fraction, we next sought to isolate that fraction from the 
5 cells using the procedures described below. 

Cells were embedded, lysed and deproteinized in 
agarose blocks in order to minimize shear damage to the 
DNA (Smith et al.. In Methods in Enzvmoloav. Gottesman 

10 (Bd.)/ Academic Press, San Deigo, Vol 151, p. 461,, 1987). 

Agarose blocks, with each sample containing approximately 
3 ug of DNA, were cut to fit gel slots « Samples were 
suspended in 1 ml of TAFE buffer (10 nflf Tris-acetate, pH 
8,0; 0.5 mM EDTA) in 12 x 75 mm polystyrene culture tubes 

15 and exposed to ^Cs gamma rays at a dose-rate of 45 
Gray/min to linearize the DNA for pulse field 
electrophoresis (van der Blick et al., NAR 16:4841-4851, 
1988; Beverly, NAR 16:925-939, 1988; Ruiz et al., Mol. 
Cell. Biol. 98:109-115, 1989). The inventors exposed 

20 agarose plugs containing unsheared DiFi cellular DNA to 

varying doses of gsunma radiation prior to analysis by 
pulse-field gel electrophoresis. Appropriate levels of 
exposure were estimated based on an expected yield of 1*1 
X 10"* double-strand breaks/Gy/bp (calculated from Krisch 

25 et al., Rad. Res. 101:356-372, 1985). 

IV. PUI.SED-FIELD GEL ELECTROPHORE SIS WAS EMPLOYED TO SIZE 
DNA 

Following irradiation, the samples were loaded into 
30 1% agzurose gels and subjected to transverse alternating 
field electrophoresis (TAFE) using TAFE buffer in a 
GeneLine system (Beckman Instruments) • Agarose plugs 
containing yeast chromosomes or concatemers of lambda 
phage DNA were included on gels as size standards. 
35 Initial current was held constant at 170 ma for 30 min, 

reorienting the direction of the electrical field every 4 
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sec^ follow d by a constant current of 150 mA f r 18 hr 
with a field reorientation interval of 60 sec. 

Some experiments ei^loyed the clamped homogeneous 
5 electrical field (CHEF) protocol for pulsed-f ield gel 
electrophoresis (Caiu et al-, SSXSnSS. 232:65-68, 1986). 
Here, electrophoresis was performed in 0.5x TBE buffer (45 
mM boric acid, 45 mM Tris and 2 mM EDTA, pH 8.3) at a 
constant current of 70 volts reoriented every 15 min for a 
10 total of 3 days. 

fiQTTTOERN TpftMfiPgR AND HYBRTDIZATIQH 

tjpon completion of electrophoresis, staining (0.5 
ug/ml ethidium bromide) , and photography, gels were 

15 irradiated for 5 min with 254 nm UVL (Gelman Instrument 

Co., Model 51438). This was followed by gentle shaking in 
0.25 M HCl for 5 min for depurination, rinsing in 
deionized water, soaking in 0.4 M NaOH for 1 hr for 
hydrolysis of depurinated bases, rinsing in deionized 

20 water, and soaking in 0.2 M NaOH, 0.6 M NaCl for 1 hr for 

denaturation. The DNA was transferred to a Zetabind nylon 
membrane (AMF Cuno, Inc.) in the denaturing solution for 
15-20 hrs. The filter was then treated with two 15 min 
washes in a neutralizing solution (0.5 M Tris-HCl, pH 7.5; 

25 1.5 M NaCl) and dried in a vacuum oven at 80«C for 1 hr. 

Labeling of probe, hybridization to filters and 
autoradiography for visualization of fragments were 
performed as previously described (Amasino, An^l. pj^Qcft^ig. 
152:304-307, 1986; Liu et al.. Science 246:813-815, 1989). 

30 

Figure 2, an autoradiogram of a Southern blot of a 
TAPE gel probed with 32P-labeled EGFR , demonstrates 
electrophoretic mobilization of EGFR genes by gamma 
irradiation. The origin (o) as well as the direction of 
35 migration is indicated at the top of the figure. Plug 

samples 1-8 were exposed to 0, 5, 10, 20, 40 80, 160, 320 
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Gy, respectively. Hybridization membranes were exposed to 
film f r 24 hrs. 



Southern analysis of a gel hybridized with an EGFR 
probe demonstrated the dose dependent migration of two 
different sized fragments containing KGFR secpiences (Pig. 
2) . The pattern of migration of total DNA was observed by 
staining gels with ethidium bromide (data not shown) . 
Dose-dependent increases were observed in the amount of 
random sized DNA fragments migrating between the sample 
well and the front of each lane. Increased amounts of DNA 
also accumulated in the zone representing molecules of 
2500 kb or larger under the electrophoresis conditions 
employed. The JSSEE-containing fragments migrated at a 
position consistent with approximately 650 kb and 1300 kb 
representing faster and slower migrating forms, 
respectively. The origin is indicated by "O." 

Figure 3, an autoradiogram of a Southern blot of a 
TAFE gel probed with 32P-labeled Esm, demonstrates the 
effect gamma irradiation has on the electrophoretic 
patterns of migration of EGFR secpiences in A431, DiFi, and 
HeLa cells. The origin and direction of migration are as 
in Fig. 2. DNA plugs from A431, DiFi and HeLa cells were 
irradiated with increasing amounts of radiation: Lane 
(A): 06y; Lane (B) : 10 Gy; Lane (C) : 40 Gy; Lane (D) : 160 
Gy. The autoradiographic exposiire was extended to 72 hr 
in order to enhance sensitivity for detecting any 
fragments that might have migrated form the A431 plugs. 



Dose dependent increases were observed in the amounts 
of randomly broken DNA fragments migrating from sample 
wells into each lane. As is observed , EGFy amplification 
is much higher in DiFi DNA and A431 DNA when compared to 
HeLa DNA. More importantly, sample plug irradiation did 
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not release discrete sizes of HeLa and A431 EGFR sequences 
were (c nflrmed by exposing autoradlograms for 7 days, 
data not shown) • However, moblllzatior of both the 650 kb 
band and 1300 kb band DlFi £SSEB fragments were readily 
detected. To summarize, EGFR sequences in both HeLa and 
A431 DKA appear to be chromosomally localized. In 
contrast, EGFR sequences in DlFl DNA appeau: to be 
episomally (extrachromosomally) localized and may be 
substantially purified by the procedure described here. 



Figure 4 presents CHEF analysis of EGFR from gamma 
irradiated DlFi DKA. Plugs containing DlFl DNA were 
exposed to 31.4 Gy prior to electrophoresis. The analysis 
of control (c) and irradiated (R) samples was performed in 
15 duplicate. Approximate sizes of the observed fragments, 
in Icbs, are Indicated to the right. Irradiating DlFi 
plugs and conducting CHEF electrophoresis under conditions 
that resolve larger DNA fragments revealed the presence of 
a weakly hybridizing band of approximately 2,000 kb, in 
20 addition to the 650 kb and 1300 kb fragments (Fig. 4) . In 

unirradiated control lanes (C) a small portion of 
BSHS-containing molecules were observed to have migrated 
into the gels. This observation was previously attributed 
to degradation of cellular DNA during the preparation of 
25 agarose plugs (van der Blick, et al., nag 16:4841-4851, 

1988) . 

VI. SUMMARY 

J,j\ sjty hybridization, using a biotinylated cDNA 
30 probe for the epidermal growth factor receptor ( EGFR ) 

gene, demonstrated that amplified EGFl^ in colon tumor cell 
lines, DlFi, is localized to many small double minute 
chromosomes of varying size and visibility. Analysis of 
the electrophoretic mobility of gammairradiated DNA from 
35 DlFl by pulsed-fleld gel electrophoresis and Southern blot 
hybridization using EGFR probe. Indicated that the 
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amplified ESSE in DiFi exists in xtrachromosomal , 
covalent:ly-closed circular episomes, probably equivalent 
to dxoin. Tvo aiajor and one minor species were observed 
having estimated sizes of 650 kb, 1300 Wb, and 2000 kb. 
5 The DiFi cell line appears to represent a unique case of 
extrachromosomal £SEfi gene amplification in human cells. 
DiFi represents the first example of a stably maintained 
episome in which EGFR is amplified* 

10 gmPftB 9 

COHBTRUCTIKO A MMOIMiIAM BPISOMAL 
BXPRBSfiZOH OR CLONZHa VECTOR 

The identification, characterization and isolation of 
DNA regulatory regions within the episomes that function 

15 a) as origins of autonomous DNA replication, and b) 
function as stabilizing regions for extrachromosomal 
maintenance will permit the construction of cloning and 
expression vectors that replicate and fxmction as 
extrachromosomal vectors. The following is meant to serve 

20 as one exan^le of identifying and isolating such 

regulatory factors from the episomal unit maintained in 
human tximor cell. In some instances, reference is made t 
working with the episomal unit from DiFi cells; DiFi is 
used here only as an example. 

25 

1^ IDENTIFICATION AND ISOIATION OF REGtTTA TORY ELEMENTS 
IW STAPLE EPISOMAL UNITS ESTABLISHED I N HUMAN TUMOR 
CELh TWINES 

AuL gpj^Qme I^Qlfltjon 
30 In order to identify and isolate replication 

regulatory elements from an episome, the episome itself 
must first be isolated. 



35 



The ideal starting point is a preparation that is 
highly enriched for the episomes of interest. A highly 
enriched source of fiSEfi-containing episomes is the human 
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OlFi cell line. DNA will h Isolated from this enriched 
preparation and most of the DiFi genomic DNA can be 
eliminated from this preparation by employing an alkaline 
lysis modification (Griffin, et al., J> Virol. , 40:11-19, 
5 1981) . An essentially pure preparation of DiFi episomes 
can then be obtained by preparative electrophoresis on 
agarose gels that permits the mobilization of covalent 
circular DNA molecules (Carroll et al., Mol, Cell. Biol- , 
7:1740-1740 (1987)). These r^olecules can then be 
10 recovered from the gels by procedures that dissolve or 

digest (agarose) the agarose and permit the episomal DNA 
to be piirified directly from the digest* 

Determine a Restriction Ma p of the Episomal 
15 Genome. 

A restriction enzyme analysis will be performed after 
the episome is isolated. For example, most of the DiFi 
episome can be separated into two pieces by exploiting the 
limited number of sites susceptible to restriction enzymes 
20 Mlul (2 sites) and Not! (2 sites) • Hlul cuts at two 
closely spaced sites whereas NotI cuts at two widely 
distant sites. Table 1 presents macrorestriction fragment 
sizes of DiFi episomes digested with Mlul and NotI 
restriction enzyme. 

25 

TABLE 1 

MACRORESTRICTION FRAGMENT SIZES OF EPISOMES 

DIGESTED WITH Mlul AND NotI RESTRICTION ENZYME 

Restriction Enzyme Fragment Slzeg 

30 Mull - 50 kb, - 600 kb 

NotI - 270* kb, - 380** kb 

Mlul + NotI -50 kb, -220 kb, -380 kb 

* The 3^ end of this fragment contains the 5' 
35 untranslated region, exon I, and the 5' end of 

intron I of the E6FR gene. 
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** Til 5' end f 'this fragment, contains the 

remainder f the EGFR gen from intron I through 
the 3' terminus of the gene. 

Digestion of total DiFi DNA with Mlul and 
electrophoresis on agarose gels using a pulsed field gel 
electrophoresis format (Chu et al., ggjlence 232:65*68, 
1986) permits isolation of the region in the gel 
containing DNA fragments of -600 kb. Digesting the 
agarose plugs with NotI further reduces the size 
distribution pertaining to genomic DNA and also cleaves 
the DiPi episome into its expected fragments. This 
protocol yields identifiable and highly enriched DiFi 
episomal fragments on a background of digested genomic 
DNA. The individual episomal Noti fragments (-220 and 
-380 kb) are concentrated by electrophoresis in a second 
dimension, and then recovered from the gel by procedures 
that dissolve or digest agarose, thereby allowing 
purification of the desired DNA fragments for cloning. 

C . construction of DiFi Eois ome Recombinant DNA 
JUiJorarjes 

1. Lambda Libraries 

Lambda libraries were constructed that represented 2 
to 10 kb portions of the DiFi episome by utilizing 
partially restriction enzyme digested episomes or NotI 
fragments and the Lambda-Zap phagemid vector (Short, 
Fernandez, Sorge, and Huse, Nuc. Aci ds Res. 16:7583-7600, 
1988) . 

2. Cosmid Libraries 

Cosmid libraries are constructed with BamHI peortial 
digests of isolated episomes or NotI DiFi episomal 
35 fragments by utilizing the sCosl vector (Evans, et al. 

Gene , 79:9-20, 1989). These cosmid libraries represent 
portions of the DiFi episome in approximately 40 kb 
blocks • 



10 



15 



25 



30 
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3. PI Llbrarias 

Recombinant DNA libraries containing portions of the 
DiFi episome are constructed by utilizing the PI 
bacteriophage based cloning vector (Sternberg, Proffn Mtn 
5 Aead- sci, USA . 87:103-107, 1990). This PI library 

contains DiFi eplsomal portions representing two size 
ranges: less than 30 kb and approximately 85 - 110 kb. 

4. Plaaaid libraries 

10 Recombinant OKA libraries containing portions of the 

DiFi episome are constructed utilizing an E, coli F sex 
factor based cloning vector (Leonardo and Sedivy, 
Biotechnology . 8:841, 1990). This F plasmid library 
contains DiFi episomal portions up to at least 150 kb. it 

15 should be understood that other plasmid libraries can be 

constructed using one of several available plasmid vectors 
(i.e. pKS, pT7\T3a-18, etc.)- These vectors are known to 
those skilled in this art. 

20 Tdentif ication of Functional Regions Within 

Episomes Regulat ing DWA Replication 
In order to identify distinct episomal regions for 
replication, various portions of recombinant DiFi episomal 
DNA libraries (from the above section) are first 

25 introduced into appropriate mammalian host cells (Krysan, 

et. al., MQl, cell . Biol.. 9(3):1026, 1989). Autonomously 
replicating segments from the DiFi episome are first 
identified and the isolated segment is incorporated into a 
cloning or expression vector. Any trans faction method may 

30 be employed for introducing portions of the recombinant 

library into mammalian host cells (i.e. calcium phosphate 
transfection (Chen and Okayama, Molec. Cell. Biol. 
7:2745-2752 (1987)); electr operation (Chu et al., Nucl 
Acids Res. 15:1311-1326 (1987)). 



35 
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For exan^le, p ols o£ approximately 10 different 
plasmid vector clones from the DiFi Cosl library are 
introduced into for example, HSF56 human primary 
f itaroblaet cells via calcium phosphate transf action or 
5 electroporation. Each cosl vector clone contains a 

selectable marker that confers drug resistance to 6418, 
for example. Retention and replication of transf acted 
clones are identified by growing the transfected 
population of HSF56 cells in the presence of G418, a 

10 compound which specifically selects for cells that are 
neomycin resistant* The cells are placed under 6418 
selection 2 days after tremsfection, and G418 reslstamt 
populations are grown for at least two months by 
maintaining the resistant clones appropriate subculturing 

15 techniques known to those skilled in the art of tissue 

culture. 

Neomycin resistant clones that persist for several 
cell divisions therefore contain a DiFi Cosl vector clone 

20 that is replicating* A persistent neomycin resistant cell 

clone is recovered and low molecular weight DNA (less than 
120 kb) is isolated by the HIRT extraction method (Hirt, 
J. Mol. Biol. ■ 26:265-369, 1967). The DNA isolated from 
this neomycin resistant cell clone will be subcloned into 

25 plasmid vectors that accommodate smaller inserts, such as 

the pKS vector or the pT7/T3a-18 vector, which, 
preferably, will also contain a selectable marker, such as 
a gene encoding beta lactamase, which confers resistance 
to ampicillin. 

30 

The result of this will be another plasmid libreury 
which includes specific regions, one or more of which 
contain an origin for DNA replication. The clones from 
this new library will next be introduced into bacteria and 
35 bacterial colonies resistant to, for example, ampicillin, 

will be isolated. In a preferred embodiment, the host is 
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an Et C9li cell of a type which is compatible with the 
vector type. 

To determine if the DNA in the bacterial colonies 
5 contain an origin of DNA replication, the DNA from the 
aapicillin resistant bacterial colonies will be 
trmsfected into a mammalian cell line. The DNA (isolated 
with the HIRT extraction method) from the transfected 
mammalian cells will be emalyzed by the Dpn I digestion 
10 (Krysan et al., Molec. Cell, Biol. 9:1026-1033, 1989 which 

is incorporated herein by reference) . DNA exhibiting the 
bacterial methylation pattern is cleavable by Dpn I 
restriction enzyme while DNA with mammalian methylation 
pattern is not. Thus, DNA that is not digested by Dpn 1 
15 has replicated in the mammalian cell. The origins for DNA 
replication will then be identified within the inserts in 
autonomously replicating clones. The origin can then be 
removed from the vector, and inserted into the recombinant 
cloning vector. Vectors that include regions from the 
DiFi episome are designated pDFE ori* and will serve as 
the recipients for inclusion of other regions of the DiFi 
episome conferring episome maintenance. 



20 



£>. Identification of Function^ ^l Regions Within 
25 EPjsppes Regulating Extrachromosomal M a iniienanc^ 

Identifying those individual clones that contain a 
region conferring extrachromosomal stability is determined 
by long term culturing (longer than two months) in the 
presence of a selection drug. The clones that survive the 
30 continuous exposure to the selection drug must contain a 
region that confers extrachromosomal stability. 

Briefly, clones that persist during several cell 
division cycles will also be evaluated to identify regions 
35 within episomal DNA that confer stability for maintenance 
of extrachromosomal molecules. The procedxire by which 
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Is latlon of "this region is essentially the same one as 
described for identifying the replication region, except 
that vectors containing DiFi episonal origins of 
replication will be used to clone other restriction 
fragments from the DiFi episome. Once the first round of 
drug resistant cell colonies are identified, the episomal 
DHA may be isolated and introduced into bacteria and 
bacterial colonies resistant to, for example, ampicillin, 
will be isolated. 



To determine if the DNA in the bacterial colonies 
contain a region conferring extrachromosomal stability, 
the DNA from the ai^iclllin resistsmt bacterial colonies 
will be transf acted into a mammaliem cell line. The DNA 
15 (isolated with the HXRT extraction method) from the 

transfected mammalian cells will be analyzed for fragment 
size and, depending on that size, another cycle may be 
initiated to further reduce the size of the piece of DNA 
that confers the extrachromosomal steUbillty. 

20 

Xn addition to evidence for extrachromosomal 
stability that is provided by the vector's provision of 
drug resistance, the intranuclear localization of vector 
episomes will be evaluated. Vector-containing cells are 

25 treated with the non-ionic detergent Triton X-100 and 2M 
NaCl. This treatment produces salt extracted residual 
nuclei, called nucleoids, which can be centrlfuged into a 
pellet at low speeds. Vectors associated with the nuclear 
matrix will pellet with the nucleoids; if they do not 

30 pellet with the nucleoids they will remain in the 
extracts' supernate. 

Both the region for the origin of DNA replication and 
for extrachromosomal maintenance will be defined as the 
35 core structtures of both the cloning and expression vector 

and will be designated PDFE orl* mat*. 



\ 
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Construction of optimal EukarvoHic cloning 
Vector to Accommodate 40 kb - dOO kb Pieces of 
DNA. 

Once the core structure is determined, construction 
5 of the optimal eukaryotlc cloning or expression vector 
will be completed. This is accomplished by adding the 
following three features to the core structiure (these will 
be discussed below) : 

10 a. a DNA or genomic DNA region encoding at least 

one selectable marker; 

b. a DNA or genomic DNA region encoding a multiple 
cloning site; and 

15 

c. a DNA or genomic DNA region encoding bacterial 
components necess£ury for propagation of the 
vector in bacteria. 

20 Selectable markers, for mammalian cells, confer 

resistance to a specific selection agent once DNA 
conferring the resistance is transfected into individual 
cells possessing a genetic inheritance pattern appropriate 
for the selectable marker being used in the vector. There 

25 are a variety of different dominemt and recessive 

selection agents known to those skilled in the art. Any 
one of the following genes and agents should be effective 
in terms of employing a selection system: 

30 □ G418 resistance is selected by exposure to 

medium containing 100 to 800 ug/ml G418. G418 
selects for cells deficient in the enzyme 
aminoglycoside phosphotransferase and are 
referred to as neomycin resistant cells. 

35 (Southern and Berg, J, Molec, Annl. Gen., 
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1:327-341, 1982; Colbere-Garapin et al., 
MQlee. Biol.. 150:l, 1981). 

HAT resistance for forward selection 
(converting a thymidine Icinase minus cell to a 
thymidine kinase positive cell) is selected with 
complete medium supplemented with 100 uH 
hypoxanthine, 0.4 uM aminopterin, 16 uM 
thymidine and 3 uM glycine. HAT medium selects 
for variants defective in either 
hypoxanthine-gu6uiine phosphoribosyl-transf erase 
or thymidine kinase (Littlef ield, Proc, Natl. 
Acad, Sei. USA . 50:568, 1963; Littlefield, 
science . 145:709-710, 1964). 

Hygromycin B resistance is selected by 
exposure to complete medium supplemented with 10 
- 400 ug/ml hygromycin B. Hygromycin B selects 
for variants defective in the enzyme 
hygromycin-B-phosphotransf erase (Gritz and 
Davies, Gene . 25:179-188, 1983; Santerre, et 
al.. Gene . 30:147, 1984; Palmer, et.al., Proc. 
Natl. Acad. Sei. USA. 84:1055-1059, 1987). 

Adenine phosphoribosyltransf erase (APRT) 
positive variants are selected by exposure to 
medium supplemented with 25 iiM alanosine, 50 uH 
azaserine and 100 uH adenine (Lowy, et. al., 
cell , 22:817, 1980; Adair, et. al., Proc. Natl. 
Acad. Sci. USA . 86:4574-4578, 1989). 

Xanthine - Guanine 
Phosphoribosyltransf erase (XGPRT) positive 
variants are selected with complete medium 
supplemented with dlalyzed fetal calf serum, 250 
ug/ml xanthine, 15 ug/ml hypoxanthlne, 10 ug/ml 
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thymidlne, 2 ug/ml aminopterln, 25 ug/ml 
aycophenolic acid, and 150 ug/ml Xj-glu'baaiine 
(Mulligan and Berg, Proe, Matl. Aoad, Sci, USA. 
78:2072-2076, 1981). 

5 

a Methotrexate resistance is selected by 

exposure to complete medium supplemented with 
0*01 iiM - 300 uH methotrexate and dialyzed fetal 
calf serum. Methotrexate selects for cells 
10 expressing high levels of dihydrof olate 

reductase (O'Hare, et al., Proc. Natl. Acad. 
Sci. USA . 78:1527, 1981; Simonsen and Iievinson, 
P^Q. Watl. Acad- Sci. USA. 80:2495-2499, 1983). 

15 Q Deoxycoformycin resistant cells are 

selected by exposure to complete medium 
supplemented with 10 ug/ml thymidine, 15 ug/ml 
hypoxanthine, 4 uH 9-B-D- xylofuranosyl adenine 
(XylA) , amd O.Ol - 0.03 uM 2 '-deoxycoformycin 

20 (dCF) • This selection selects for mutants 

expressing adenosine deaminase (ADA; Kaufman, 
et. al., Proc. Matl. Acad. Sci> USA. 
83:3136-3140, 1986). 

25 For added ease in handling and manipulating, this 

optimum eukaryotic cloning vector could include a DNA 
region comprising a multiple cloning cassette secpience 
containing infrequent cutting by restriction enzymes to 
facilitate the insertion of a desired gene. Multiple 

3 0 cloning cassette sequence caortridges are commercially 
available from several different companies (Stratagene , 
Promega, New England Biolabs etc). A typical cassette 
sequence cartridge would incade restriction sites for 8 - 
11 different enzymes (i.e. Eco Rl, Sacl, Sma l, Ava 1, Bam 

35 HI, Xba 1, Hinc II, Acc 1, Sal 1, Pst 1, Hind III, etc.). 
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The availability of these cassette cartridg s are known to 
those skilled in the art. 

The bacterial plasmld sequences nay be derived from 
5 any one of the many different vectors that are 

commercially available and known to those skilled in the 
art of recombinant DNA technology. For the piirpose of 
this invention, pUC, pKS, pBR322 and pT7/T3al8 are used as 
a matter of preference, however, other vectors would be 
10 equally effective. For example, if pBR322 sequences are 
introduced into the cloning or expression vector, the 
resulting recombinant can then be shuttled back and forth 
between E. coli and mammalian cells. 

15 The construction of an optimal eukaryotic expression 

vector that can accommodate 40 kb - 400 kb pieces of DNA 
will also contain, in addition to the elements described 
for the cloning vector, a DNA region containing a 
promoter, a polyadenylation and splice site necessary for 

20 the expression of the desired gene. 

There are at least two approaches for constructing an 
optimal eukaryotic cloning vector that can accommodate 40 
- 400 kb pieces of DNA. 

25 

1. The first and more simpler approach is to begin 
with a readily available cloning plasmld vector 
capable of propagation in bacteria. There are 
many different vectors known to those skilled in 

30 the art that would work efficlenty. Several 

different components and features can easily be 
ligated into this bacterial plasmld vector. 
These added features are discussed below. Once 
completed, the vector will not only have the 

35 core structure (to confer the ability to 

replicate DNA and to be maintained 
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extrachromos mally) but will also have the added 
features to optimize the vector for propagation 
in bacteria and for identification of its 
presence after transfection into a manmalian 
5 cell recipient. 

2. The second approach involves custom designing 
andcreating the optimum cloning vector by 
ligating all the desired feattires and components 
10 (including the core structure) together to 

generate the vector of choice. 

fi^ Construction of a Mammalian Artificial Chromosome 

The episomally maintained and replicated vector pOFE 
15 ori*^ mat^ is introduced into cells and persist as covalent 
circular extrachromosomal molecules. In this form the 
episomes accumulate to produce multiple copies in each 
cell and accordingly^ also overproduce mRNA and its 
protein product. While this is desirable for producing 
20 amplified genes and gene products, the introduction of 

cloned genes into cells for use in gene therapy requires 
the control of gene copy niimber and attendant gene 
expression. Such control is introduced into the DiFi 
episome vector by introducing DHA sequences that stabilize 
25 artificial chromosomes containing linear double stranded 
DNA (ONA encoding a telomere) • Such sequences occur at 
the termini of natural chromosomes; in human chromosomes 
5'-AGGGTT-3' is tandemly repeated to the extent of 10 of 
15 kb at every telomere (Blackburn, Science , 249:489, 
30 1990) . This tandemly repeated sequence is ligated to each 

end of a linearized cloning and expression vector to 
stabilize the termini. The addition of telomere sequences 
specific for other species provides for the stabilization 
of artificial chromosomes when introduced into those 
35 species. 
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Cente mere secpiences are known to Identify regions 
within chromosoines where kinetichores are organized and 
mitotic spindles are attached to the chromosomes, thus 
ensuring for the segregation of chromosomes during 
5 mitosis. DNA sequences that serve as centromeres are 
introduced into an internal region of the linearized 
cloning and escpression vector which contain telomeres 
resulting in an artificial chromosome. This synthetic 
chromosome contains required regulatory and stabilizing 
10 DNA sequences that normally occur in natural chromosomes. 

Specific genetic function is conferred on this 
synthetic chromosome by ligating a gene of interest into 
its multiple cloning site. For example, the gene or cDNA 
15 derivative of the gene that is defective in Duchenne's 

muscular dystrophy or myotonic dystrophy, or one of a 
number of other diseases associated with muscle 
dysfunction may be cloned into the artificial chromosome. 
The artificial chromosome is then introduced into cells or 
20 tissues or animals by methods appropriate for the target. 

The transfected chromosome is established as an integral 
component of the recipient cells where it is stably 
maintained and expressed. Recipient cells, tissues or 
animals that were initially dysfunctional because of a 
25 genetic defect they possessed are cured and become normal 
because of the expression and synthesis of the normal gene 
product introduced in the artificial chromosome. 

IL. Evaluation of Different St rategies for 

Transfectina Cloning or Expression Vectors Into 
Mammalian Cells 
Once the optimal cloning and expression vector is 
constructed, several different strategies for transfecting 
the vectors will be studied. Exaonples of potential 
methods includes: (i) encapsulation of insert-containing 
vectors in liposomes of appropriate composition to enhance 



30 



35 
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entry into target cells, and (2) electrop rati n of vect r 
into mitotic cell recipients to enhance its inclusion 
within the nucleus as cells progress into 61 phase of the 
cell cycle, and (3) injection of DNA-encoated particles 
5 into cells by exaploying a Biolistic Particle Delivery 
system (DuPont) , This procedure essentially shoots 
DNA-coated bullets into cells or tissues. 

X.. Blosvnthetic Production of Proteins in cellfi 
1<> Trangfected With Cloning and Expressio n Vectors 

containing Isolated Genes or Functional 
Derivatives 

Medically important proteins are produced in 
mammalian cells that have been transfected with the vector 
15 containing the gene encoding the protein. Since the 

gene-containing vector accumulates in the transfected 
cells, the amount of protein produced increases as more 
vector copies accumulate. The following example 
illustrates an efficient system for protein production. 
20 To produce the product of the gene that is deficient in 
patients with myotonic dystrophy, the vector containing 
the normal gene is electroporated into a normal primary 
human fibroblast cell line HSF56, adapted for growth in 
suspension culture in serum free medium. The accumulation 
25 of the cloning vector in each cell is accelerated by 

growing the cells in the drug appropriate for the drug 
resistance gene contained in the vector. As the gene copy 
number accumulates the amount of protein increases to be 
recovered from the culture medium or from the cells after 
30 maximal growth is achieved. The medical condition of 
patients with myotonic dystrophy may be improved by 
treatment with the protein that is provided by this 
cloningexpression system. Modification of the vector to 
include other genes and selection of target cells and 
35 appropriate culture conditions provides endless possible 
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sys'tems for the production and Isolation of mammalian 
proteins * 

The foregoing description has been directed to 
5 particular embodiments of the invention in accordance with 
the requirements of the Patent Statutes for the purposes 
of illustration and explanation. It will be apparent, 
however, to those skilled in this art, that many 
modifications and changes in the apparatus and procedure 
10 set forth will be possible without departing from the 

scope and spirit of the invention. It is intended that 
the following claims be interpreted to embrace all such 
modifications and changes. 



15 
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CIATMS 

1. A composition of ma-kter comprising a 
substantially purified non-vlral eplsome of human origin 
capable of stable extrachromosomal maintenance and of 
autonomous replication In a compatible mammalian cell 
line. 



10 2. A substantially purified eplsomal DNA segment 

containing an origin of replication. 



3. A substantially purified eplsomal DNA segment 
15 containing a DNA sequence which confers upon a vector 

1 auding said segment the ability to be stably maintained 
extrachromosomal ly in a cell transfected with said vector. 



20 4. A substantially purified eplsomal DNA segment 

containing an origin of replication and a DNA sequence 
which confers upon a vector Including said segment the 
ability to be stably maintained extrachromosomally in a 
cell transfected with said vector. 



25 



30 



5. The substantially purified eplsomal DNA segment 
of claim 2, 3, or 4 wherein the eplsomal DNA segment is 
from an eplsome Isolated from DlFl colorectal cell line. 



6. A cloning vector comprising the following 
components operatively spaced with respect to a desired 
gene: 



35 
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a) a DNA segment derived from a non-viral eplsome, said 
segmen't containing an origin for DNA replication; 

b) a DNA segment derived from a non-viral episome, said 
5 segment containing a DNA sequence which confers upon 

said vector the ability to be stably maintained 
extrachromosomally in a cell transfected with said 
vector; 

10 c) a DNA segment containing a multiple cloning site; 

d) a DNA selectable marker segment conferring upon a 
cell transfected with said vector the ability to 
survive in the presence of a selected compound or 

15 selected group of compounds; and 

e) a DNA segment encoding bacterial components necessary 
for propagation of said vector in bacteria. 



20 



25 



7* The cloning vector of claim 6 wherein said 
compoxind is selected from the group consisting of G418 and 
hygromycin B. 



8. The cloning vector of claim 6 further including 
a DNA sequence encoding a desired protein* 



30 9. The cloning vector of claim 6 wherein the 

segment containing the origin for DNA replication is from 
an episome Isolated from DlFl colorectal cell line. 



35 10. The cloning vector of claim 6 wherein the 

segment containing a DNA secpience which confers upon said 
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V ctor the ability to be stably maintained 

xtrachromosoinally in a cell transf ot d with said vector 
is from an episome isolated from DiFi colorectal cell 
line. 



11. The cloning vector of claim 6 wherein the 
chromosomal DNA of said transfected cell contains a 
mutation in an enzyme, said mutation rendering said 

10 selected compound or selected group of compounds toxic to 
said cell when said selectable marker segment is not 
present in said cell and wherein said selectable marker 
segment contains DNA encoding an enzyme capable of 
functionally replacing said mutated enzyme so as to render 

15 said transfected cell resistant to said selected compound 

or selected group of compounds. 



12. The cloning vector of clr Im 11 wherein said 
20 enzyme is selected from the group consisting of: thymidine 

kinase, xanthine-guanine phosphor ibosyl transferase, 
adenine phosphor ibosyl transf erase, adenosine deaminase and 
dihydrofolate reductase. 



25 



13. A cloning vector comprising the following 
components operatively spaced with respect to a desired 
gene : 



30 a) a DNA segment derived from a non-viral episome, 

said segment containing an origin for DNA 
replication and a DNA sequence which confers 
upon said vector the ability to be stably 
maintained extrachromosomally in a cell 

35 transfected with said vector; 
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b) a DNA s gm containing a zsultiple cloning 
site; 

c) a DNA segment conferring upon a cell transfected 
5 with said vector the ability to survive in the 

presence of a selected compound or selected 
group of compounds; and 

d) a DNA segment encoding bacterial components 
10 necessary for propagation of said vector in 

bacteria. 



14. The cloning vector of claim 13 wherein said 
15 compovmd is selected from the group consisting of 6418 and 

hygromycin B. 



15. The cloning vector of claim 13 further including 
20 a DNA sequence encoding a desired protein. 



16. The cloning vector of claim 13 wherein the DNA 
segment containing an origin for DNA replication and a DNA 

25 sec[uence which confers upon said vector the ability to be 

stably maintained extrachromosomally in a cell transfected 
with said vector is from an episome isolated from DiFi 
colorectal cell line. 

30 

17. The cloning vector of claim 13 wherein the 
chromosomal DNA of said transfected cell contains a 
mutation in an enzyme, said mutation rendering said 
selected compound or selected group of compoiinds toxic to 

35 said cell when said selectable marker segment is not 

present in said cell and wherein said selectable marker 
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10 



25 



30 



segment contains DNA encoding an enzyme capable f 
functionally replacing said mutated enzyme so as to render 
said transfected cell resistant to said selected compound 
or selected group of compounds. 



18. The cloning vector of claim 17 wherein said 
enzyme is selected from the group consisting of: thymidine 
kinase, xanthine-guanine phosphoribosyl transferase, 
adenine phosphor ibosyltransf erase, adenosine deaminase and 
dihydrofolate reductase. 



19. An expression vector comprising the following 
15 components operatively spaced with respect to a desired 
gene: 

a) a DNA segment derived from a non-viral episome, 
said segment containing an origin for DNA 

20 replication; 

b) a DNA segment derived from a non-viral episome, 
said segment containing a DNA sequence which 
confers upon said vector the ability to be 
stably maintained extrachromosomally in a cell 
transfected with said vector; 

c) a DNA segment containing a multiple cloning 
site; 



35 



a DNA segment conferring upon a cell transfected 
with said vector the ability to survive in the 
presence of a selected compound or selected 
group of compounds; 
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e) a DNA segment encoding baotzerial comp nants 
necessary f r pr pagati n f said vector in 
bacteria; and 

£} a promoter, a polyadenylation site, and a splice 
site in special relation to allov the efficient 
esqoression of a structural gene upon insertion 
of said gene into said splice site. 



20. The expression vector of claim 19 wherein said 
compound is selected from the group consisting of 6418 and 
hygromycin B. 



21. The expression vector of claim 19 further 
including a DNA sequence encoding a desired gene. 



22. The expression vector of claim 19 wherein the 
bacterial con^onents necessary for propagation of said 
vector in bacteria are derived from pBR322> pUC, pT7/T3a- 
18 or pKS. 



23. The expression vector of claim 19 wherein the 
segment containing the origin for DNA replication is from 
an episome isolated from DiFi colorectal cell line. 



24. The expression vector of claim 19 wherein the 
segment containing a DNA sequence which confers upon said 
vector the ability to be stably maintained 
extrachromosomally in a cell transfected with said vector 
35 is from an episome isolated from DiFi colorectal cell 

line. 
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25. The expression vector of claim 19 wherein the 
chroiaosomal DNA of said transf eoted cell contains a 
mutation in an enzyme, said mutation rendering said 
5 selected compound or selected group of compounds toxic to 
said cell when said selectable marker segment is not 
present in said cell and wherein said selectable marker 
segment contains DNA encoding an enzyme capable of 
functionally replacing said mutated enzyme so as to render 
10 said trcuisfected cell resistant to said selected compound 
or selected group of compounds. 



26. The expression vector of claim 25 wherein said 
15 enzyme is selected from the group consisting of: thymidine 

kinase, xanthine-guanine phosphoribosyl transferase, 
adenine phosphor ibosyltransf erase, adenosine deeuainase and 
dihydrofolate reductase. 

20 

27. An expression vector comprising the following 
components operatively spaced with respect to a desired 
gene: 

25 a) a DNA segment derived from a non-viral episome, 

said segment containing an origin for DNA 
replication and a DNA sequence which confers 
upon said vector the ability to be stably 
maintained extrachromosomally in a cell 

30 transfected with said vector; 

b) a DNA segment containing a multiple cloning 
site; 



35 



c) 



a DNA segment conferring upon a cell transfected 
with said vector the ability to survive in the 
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presence f a selected c mp und r sel cted 
group of compounds; 

d) a DNA segment encoding bacterial components 
5 necessary for propagation of said vector in 

bacteria; and 



e) a promoter, a polyadenylation site, and a splice 
site in spacial relation to allow the efficient 
10 expression of a structural gene upon insertion 

of said gene into said splice site. 



28. The expression vector of claim 27 wherein said 
15 compotind is selected from the group consisting of G418 and 

hygromycin B. 



29. The expression vector of claim 27 further 
20 including a DNA sequence encoding a desired protein. 

30. The expression vector of claim 27 wherein the 
bacterial components necesssury for propagation of said 

25 vector in bacteria are derived from pBR322, pUC, pT7/T3a- 
18 or pKS. 



31. The expression vector of claim 27 wherein the 
30 DNA segment containing an origin for DNA replication and a 
DNA sequence which confers upon said vector the ability to 
be stably maintained extrachromosomally in a cell 
transfected with said vector is from an episome isolated 
from DlFi colorectal cell line. 



35 
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32 • The expression vector of claln 27 r in the 
chromosomal DNA of said transf ected cell c ntalns a 
mutation In an enzyme, said mutation rendering said 
selected compound or selected group of compounds toxic to 
5 said cell ^en said selectable marker segment Is not 

present In said cell and wherein said selectable marker 
segment contains DNA encoding an enzyme capable of 
functionally replacing said mutated enzyme so as to render 
said transfected cell resistant to said selected compound 
10 or selected group of compounds. 

33. The cloning vector of claim 32 wherein said 
enzyme is selected from the group consisting of: thymidine 
15 kinase, xanthine-guanine phosphoribosyl transferase, 

adenine phosphoribosyl transf erase, adenosine deeminase and 
dihydrofolate reductase. 

20 34. The expression vector of claim 19 or 27 wherein 

the promoter is selected from the group consisting of: 
cytomegalovirus promoter, SV-40 promoter, Rous sarcoma 
virus promoter, thymidine kinase promoter, beta-actin 
promoter, metallothionein promoter, and epidermal growth 

25 factor receptor gene promoter isolated from a OlFl 

episome. 

35. An artificial chromosome comprising: 
30 a DNA segment derived from a non-viral eplsome, said 

segment containing an origin for DNA replication, a DNA 
segment derived from a non-viral eplsome, said segment 
containing a DNA sequence which confers upon said vector 
the ability to be stably maintained extrachromosomally in 
35 a cell transfected with said vector, a DNA segment 

containing a multiple cloning site, a DNA selectable 
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marker segm n't conferring upon a cell transf ected vltli 
said vector, the ability to survive in the presence of a 
selected compound or selected group of compounds, a DNA 
segment encoding bacterial components necessary for 
5 propagation of said vector in bacteria, a promoter, a 
polyadenylation site, a splice site, a DNA segment 
encoding a centromere and a DNA segment encoding a 
telomere. 



10 



15 



36. The artificial chromosome of claim 35 wherein 
said compound is selected from the group consisting of 
6418 and hygromycin B. 



37. The artificial chromosome of claim 35 further 
including a DNA sequence encoding a desired protein. 



20 38. The artificial chromosome of claim 35 wherein 

the chromosomal DNA of said tremsfedied cell contains a 
mutation in an enzyme, said mutation rendering said 
selected compound or selected group of compounds toxic to 
said cell when said selectable marker segment is not 

25 present in said cell and wherein said selectable marker 

segment contains DNA encoding an enzyme capable of 
functionally replacing said mutated enzyme so as to render 
said transfected cell resistant to said selected compoxind 
or selected group of compounds. 

30 

39. The artificial chromosome of claim 38 wherein 
said enzyme is selected from the group consisting of: 
lihymidine kinase, xanthine-guanine phosphoribosyl 
35 transferase, adenine phosphoribosyl transf erase, adenosine 
de£uninase and dihydrofolate reductase. 
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